11 research outputs found

    Enhanced spatial pyramid matching using log-polar-based image subdivision and representation

    Get PDF
    This paper presents a new model for capturing spatial information for object categorization with bag-of-words (BOW). BOW models have recently become popular for the task of object recognition, owing to their good performance and simplicity. Much work has been proposed over the years to improve the BOW model, where the Spatial Pyramid Matching (SPM) technique is the most notable. We propose a new method to exploit spatial relationships between image features, based on binned log-polar grids. Our model works by partitioning the image into grids of different scales and orientations and computing histogram of local features within each grid. Experimental results show that our approach improves the results on three diverse datasets over the SPM technique

    Improving face gender classification by adding deliberately misaligned faces to the training data

    Get PDF
    A novel method of face gender classifier construction is proposed and evaluated. Previously, researchers have assumed that a computationally expensive face alignment step (in which the face image is transformed so that facial landmarks such as the eyes, nose, chin, etc, are in uniform locations in the image) is required in order to maximize the accuracy of predictions on new face images. We, however, argue that this step is not necessary, and that machine learning classifiers can be made robust to face misalignments by automatically expanding the training data with examples of faces that have been deliberately misaligned (for example, translated or rotated). To test our hypothesis, we evaluate this automatic training dataset expansion method with two types of image classifier, the first based on weak features such as Local Binary Pattern histograms, and the second based on SIFT keypoints. Using a benchmark face gender classification dataset recently proposed in the literature, we obtain a state-of-the-art accuracy of 92.5%, thus validating our approach

    3D face recognition using multiview keypoint matching

    Get PDF
    A novel algorithm for 3D face recognition based point cloud rotations, multiple projections, and voted keypoint matching is proposed and evaluated. The basic idea is to rotate each 3D point cloud representing an individual’s face around the x, y or z axes, iteratively projecting the 3D points onto multiple 2.5D images at each step of the rotation. Labelled keypoints are then extracted from the resulting collection of 2.5D images, and this much smaller set of keypoints replaces the original face scan and its projections in the face database. Unknown test faces are recognised firstly by performing the same multiview keypoint extraction technique, and secondly, the application of a new weighted keypoint matching algorithm. In an extensive evaluation using the GavabDB 3D face recognition dataset (61 subjects, 9 scans per subject), our method achieves up to 95% recognition accuracy for faces with neutral expressions only, and over 90% accuracy for face recognition where expressions (such as a smile or a strong laugh) and random faceoccluding gestures are permitted

    SIFTing the relevant from the irrelevant: Automatically detecting objects in training images

    Get PDF
    Many state-of-the-art object recognition systems rely on identifying the location of objects in images, in order to better learn its visual attributes. In this paper, we propose four simple yet powerful hybrid ROI detection methods (combining both local and global features), based on frequently occurring keypoints. We show that our methods demonstrate competitive performance in two different types of datasets, the Caltech101 dataset and the GRAZ-02 dataset, where the pairs of keypoint bounding box method achieved the best accuracies overall

    Improving Bag-of-Words model with spatial information

    Get PDF
    Bag-of-Words (BOW) models have recently become popular for the task of object recognition, owing to their good performance and simplicity. Much work has been proposed over the years to improve the BOW model, where the Spatial Pyramid Matching technique is the most notable. In this work, we propose three novel techniques to capture more re_ned spatial information between image features than that provided by the Spatial Pyramids. Our techniques demonstrate a performance gain over the Spatial Pyramid representation of the BOW model

    Development of a music organizer for children

    Get PDF
    Software development for children is challenging; children have their own needs, which often are not met by ‘grown up’ software. We focus on software for playing songs and managing a music collection—tasks that children take great interest in, but for which they have few or inappropriate tools. We address this situation with the design of a new music management system, created with children as design partners: the Kids Music Box

    Kid's Music Box: A Digital Music Organizer Designed with Children for Children

    Get PDF
    This thesis is an investigation of software development for children aged six to ten years old. This is a challenging area; despite the trend for children to be exposed to computer technology at an earlier age, they often struggle to utilize it effectively. Children are not merely miniature adults; they have their own needs which are often not met by traditional software. In particular, this thesis focuses on the task of music management: a task that children take much interest in but for which they are given few tools. We address this with the design of a new music management system: the Kid's Music Box. The development of Kid's Music Box is documented in four main parts: background research, requirements gathering, design and implementation, and evaluation. Background research identifies the strengths and weaknesses of conventional music organizers with respect to young users. Requirements gathering took the form of a focus group study, which aimed to overcome the distinct challenges of obtaining input from children. The design of Kid's Music Box builds on this work, by incorporating functionality, metaphors and design elements that suit the needs and capabilities of young children. Expert evaluations and formal evaluation from children users showed promising results, which concluded that Kid's Music Box is better than other organizers in terms of managing music for children

    Improving Bags-of-Words model for object categorization

    Get PDF
    In the past decade, Bags-of-Words (BOW) models have become popular for the task of object recognition, owing to their good performance and simplicity. Some of the most effective recent methods for computer-based object recognition work by detecting and extracting local image features, before quantizing them according to a codebook rule such as k-means clustering, and classifying these with conventional classifiers such as Support Vector Machines and Naive Bayes. In this thesis, a Spatial Object Recognition Framework is presented that consists of the four main contributions of the research. The first contribution, frequent keypoint pattern discovery, works by combining pairs and triplets of frequent keypoints in order to discover intermediate representations for object classes. Based on the same frequent keypoints principle, algorithms for locating the region-of-interest in training images is then discussed. Extensions to the successful Spatial Pyramid Matching scheme, in order to better capture spatial relationships, are then proposed. The pairs frequency histogram and shapes frequency histogram work by capturing more redefined spatial information between local image features. Finally, alternative techniques to Spatial Pyramid Matching for capturing spatial information are presented. The proposed techniques, variations of binned log-polar histograms, divides the image into grids of different scale and different orientation. Thus captures the distribution of image features both in distance and orientation explicitly. Evaluations on the framework are focused on several recent and popular datasets, including image retrieval, object recognition, and object categorization. Overall, while the effectiveness of the framework is limited in some of the datasets, the proposed contributions are nevertheless powerful improvements of the BOW model

    The US Program in Ground-Based Gravitational Wave Science: Contribution from the LIGO Laboratory

    Get PDF
    Recent gravitational-wave observations from the LIGO and Virgo observatories have brought a sense of great excitement to scientists and citizens the world over. Since September 2015,10 binary black hole coalescences and one binary neutron star coalescence have been observed. They have provided remarkable, revolutionary insight into the "gravitational Universe" and have greatly extended the field of multi-messenger astronomy. At present, Advanced LIGO can see binary black hole coalescences out to redshift 0.6 and binary neutron star coalescences to redshift 0.05. This probes only a very small fraction of the volume of the observable Universe. However, current technologies can be extended to construct "3rd Generation" (3G) gravitational-wave observatories that would extend our reach to the very edge of the observable Universe. The event rates over such a large volume would be in the hundreds of thousands per year (i.e. tens per hour). Such 3G detectors would have a 10-fold improvement in strain sensitivity over the current generation of instruments, yielding signal-to-noise ratios of 1000 for events like those already seen. Several concepts are being studied for which engineering studies and reliable cost estimates will be developed in the next 5 years
    corecore